Minimax semi-supervised set-valued approach to multi-class classification

نویسندگان

چکیده

We study supervised and semi-supervised algorithms in the set-valued classification framework with controlled expected size. While former methods can use only n labeled samples, latter are able to make of N additional unlabeled data. obtain minimax rates convergence under α-margin assumption a β-Hölder condition on conditional distribution labels. Our analysis implies that if no further is made, there method outperforms estimator proposed this work – best achievable rate for any O(n−1/2), even margin extremely favorable; contrary, developed achieve faster O((n/logn)−(1+α)β/(2β+d)) provided sufficiently many samples available. also show smoothness assumption, sample cannot improve convergence. Finally, numerical supports our theory emphasizes relevance assumptions we required from an empirical perspective.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Semi-Supervised Boosting for Multi-Class Classification

Most semi-supervised learning algorithms have been designed for binary classification, and are extended to multi-class classification by approaches such as one-against-the-rest. The main shortcoming of these approaches is that they are unable to exploit the fact that each example is only assigned to one class. Additional problems with extending semisupervised binary classifiers to multi-class p...

متن کامل

A semi-supervised approach to question classification

This paper presents a machine learning approach to question classification. We have defined a kernel function based on latent semantic information acquired from unlabeled data. This kernel allows including external semantic knowledge into the supervised learning process. We have combined this knowledge with a bag-of-words approach by means of composite kernels to obtain state-of-the-art results...

متن کامل

Multi-valued Approach to Near Set Theory

The aim of this paper is to introduce three approaches to near sets by using a multivalued system. Some fundamental properties and characterizations are given. We obtain a comparison among these types of approximations.

متن کامل

Semi-supervised Learning for Multi-label Classification

In this report we consider the semi-supervised learning problem for multi-label image classification, aiming at effectively taking advantage of both labeled and unlabeled training data in the training process. In particular, we implement and analyze various semi-supervised learning approaches including a support vector machine (SVM) method facilitated by principal component analysis (PCA), and ...

متن کامل

Music Genre Classification: A Semi-supervised Approach

Music genres can be seen as categorical descriptions used to classify music basing on various characteristics such as instrumentation, pitch, rhythmic structure, and harmonic contents. Automatic music genre classification is important for music retrieval in large music collections on the web. We build a classifier that learns from very few labeled examples plus a large quantity of unlabeled dat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Bernoulli

سال: 2021

ISSN: ['1573-9759', '1350-7265']

DOI: https://doi.org/10.3150/20-bej1313